Overview

Dataset Statistics

Number of Variables 34
Number of Rows 501951
Missing Cells 0
Missing Cells (%) 0.0%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 320.5 MB
Average Row Size in Memory 669.5 B
Variable Types
  • Categorical: 21
  • Numerical: 13

Dataset Insights

person_prefer_d_1 is skewed Skewed
person_prefer_d_2 is skewed Skewed
person_prefer_d_3 is skewed Skewed
person_prefer_e is skewed Skewed
person_prefer_h_1 is skewed Skewed
person_prefer_h_2 is skewed Skewed
contents_attribute_d is skewed Skewed
contents_attribute_e is skewed Skewed
contents_open_dt has a high cardinality: 494952 distinct values High Cardinality
person_prefer_f has constant value "1" Constant
person_prefer_g has constant value "1" Constant
person_attribute_a has constant length 1 Constant Length
person_attribute_a_1 has constant length 1 Constant Length
person_attribute_b has constant length 1 Constant Length
person_prefer_c has constant length 1 Constant Length
person_prefer_f has constant length 1 Constant Length
person_prefer_g has constant length 1 Constant Length
contents_attribute_i has constant length 1 Constant Length
contents_attribute_a has constant length 1 Constant Length
contents_attribute_j has constant length 1 Constant Length
contents_attribute_c has constant length 1 Constant Length
contents_attribute_k has constant length 1 Constant Length
contents_attribute_m has constant length 1 Constant Length
contents_open_dt has constant length 19 Constant Length
target has constant length 1 Constant Length
person_prefer_e has 66676 (13.28%) zeros Zeros
  • 1
  • 2
  • 3

Variables

d_l_match_yn

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 33.2 MB
  • The largest value (True) is over 1.69 times larger than the second largest value (False)

Length

Mean 4.3719
Standard Deviation 0.4833
Median 4
Minimum 4
Maximum 5

Sample

1st row True
2nd row False
3rd row False
4th row False
5th row True

Letter

Count 2194487
Lowercase Letter 1692536
Space Separator 0
Uppercase Letter 501951
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (True, False) take over 50.0%
  • The largest value (true) is over 1.69 times larger than the second largest value (false)

d_m_match_yn

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 33.4 MB
  • The largest value (False) is over 2.76 times larger than the second largest value (True)

Length

Mean 4.7338
Standard Deviation 0.442
Median 5
Minimum 4
Maximum 5

Sample

1st row True
2nd row False
3rd row False
4th row False
5th row True

Letter

Count 2376128
Lowercase Letter 1874177
Space Separator 0
Uppercase Letter 501951
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (False, True) take over 50.0%
  • The largest value (false) is over 2.76 times larger than the second largest value (true)

d_s_match_yn

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 33.4 MB
  • The largest value (False) is over 5.67 times larger than the second largest value (True)

Length

Mean 4.8501
Standard Deviation 0.357
Median 5
Minimum 4
Maximum 5

Sample

1st row True
2nd row False
3rd row False
4th row False
5th row True

Letter

Count 2434498
Lowercase Letter 1932547
Space Separator 0
Uppercase Letter 501951
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (False, True) take over 50.0%
  • The largest value (false) is over 5.67 times larger than the second largest value (true)

h_l_match_yn

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 33.1 MB
  • The largest value (True) is over 3.89 times larger than the second largest value (False)

Length

Mean 4.2044
Standard Deviation 0.4033
Median 4
Minimum 4
Maximum 5

Sample

1st row False
2nd row True
3rd row True
4th row True
5th row False

Letter

Count 2110417
Lowercase Letter 1608466
Space Separator 0
Uppercase Letter 501951
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (True, False) take over 50.0%
  • The largest value (true) is over 3.89 times larger than the second largest value (false)

h_m_match_yn

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 33.3 MB
  • The largest value (False) is over 1.74 times larger than the second largest value (True)

Length

Mean 4.6352
Standard Deviation 0.4814
Median 5
Minimum 4
Maximum 5

Sample

1st row False
2nd row True
3rd row False
4th row False
5th row False

Letter

Count 2326622
Lowercase Letter 1824671
Space Separator 0
Uppercase Letter 501951
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (False, True) take over 50.0%
  • The largest value (false) is over 1.74 times larger than the second largest value (true)

h_s_match_yn

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 33.4 MB
  • The largest value (False) is over 2.67 times larger than the second largest value (True)

Length

Mean 4.7278
Standard Deviation 0.4451
Median 5
Minimum 4
Maximum 5

Sample

1st row False
2nd row False
3rd row False
4th row False
5th row False

Letter

Count 2373126
Lowercase Letter 1871175
Space Separator 0
Uppercase Letter 501951
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (False, True) take over 50.0%
  • The largest value (false) is over 2.67 times larger than the second largest value (true)

person_attribute_a

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 31.6 MB
  • The largest value (1) is over 1.94 times larger than the second largest value (2)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 1
3rd row 2
4th row 2
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 501951
  • The top 2 categories (1, 2) take over 50.0%
  • The largest value (1) is over 1.94 times larger than the second largest value (2)
  • person_attribute_a has words of constant length

person_attribute_a_1

categorical

Approximate Distinct Count 8
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 31.6 MB
  • The largest value (0) is over 2.59 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 4
2nd row 3
3rd row 0
4th row 0
5th row 3

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 501951
  • The largest value (0) is over 2.59 times larger than the second largest value (1)
  • person_attribute_a_1 has words of constant length

person_attribute_b

categorical

Approximate Distinct Count 6
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 31.6 MB
  • The largest value (2) is over 1.61 times larger than the second largest value (3)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 3
2nd row 4
3rd row 3
4th row 2
5th row 4

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 501951
  • The top 2 categories (2, 3) take over 50.0%
  • The largest value (2) is over 1.61 times larger than the second largest value (3)
  • person_attribute_b has words of constant length

person_prefer_c

categorical

Approximate Distinct Count 5
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 31.6 MB
  • The largest value (1) is over 1.94 times larger than the second largest value (5)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 5
2nd row 1
3rd row 5
4th row 5
5th row 5

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 501951
  • The top 2 categories (1, 5) take over 50.0%
  • The largest value (1) is over 1.94 times larger than the second largest value (5)
  • person_prefer_c has words of constant length

person_prefer_d_1

numerical

Approximate Distinct Count 1093
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 7.7 MB
Mean 537.2964
Minimum 4
Maximum 1258
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • person_prefer_d_1 is skewed right (γ1 = 0.28)

Quantile Statistics

Minimum 4
5-th Percentile 95
Q1 117
Median 453
Q3 947
95-th Percentile 1227
Maximum 1258
Range 1254
IQR 830

Descriptive Statistics

Mean 537.2964
Standard Deviation 411.4419
Variance 169284.4583
Sum 2.697e+08
Skewness 0.28
Kurtosis -1.5011
Coefficient of Variation 0.7658
  • person_prefer_d_1 is not normally distributed (p-value 3.583987816255689e-15)

person_prefer_d_2

numerical

Approximate Distinct Count 1081
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 7.7 MB
Mean 545.8339
Minimum 4
Maximum 1258
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • person_prefer_d_2 is skewed right (γ1 = 0.2337)

Quantile Statistics

Minimum 4
5-th Percentile 95
Q1 151
Median 464
Q3 968
95-th Percentile 1189
Maximum 1258
Range 1254
IQR 817

Descriptive Statistics

Mean 545.8339
Standard Deviation 403.3287
Variance 162674.074
Sum 2.7398e+08
Skewness 0.2337
Kurtosis -1.542
Coefficient of Variation 0.7389
  • person_prefer_d_2 is not normally distributed (p-value 8.154058472527736e-08)

person_prefer_d_3

numerical

Approximate Distinct Count 1043
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 7.7 MB
Mean 534.9941
Minimum 4
Maximum 1258
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • person_prefer_d_3 is skewed right (γ1 = 0.2726)

Quantile Statistics

Minimum 4
5-th Percentile 95
Q1 136
Median 452
Q3 929
95-th Percentile 1227
Maximum 1258
Range 1254
IQR 793

Descriptive Statistics

Mean 534.9941
Standard Deviation 415.7521
Variance 172849.7852
Sum 2.6854e+08
Skewness 0.2726
Kurtosis -1.5734
Coefficient of Variation 0.7771
  • person_prefer_d_3 is not normally distributed (p-value 3.675881135657355e-08)

person_prefer_e

numerical

Approximate Distinct Count 12
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 7.7 MB
Mean 3.6263
Minimum 0
Maximum 11
Zeros 66676
Zeros (%) 13.3%
Negatives 0
Negatives (%) 0.0%
  • person_prefer_e is skewed left (γ1 = -0.1338)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 3
Median 4
Q3 5
95-th Percentile 6
Maximum 11
Range 11
IQR 2

Descriptive Statistics

Mean 3.6263
Standard Deviation 1.8467
Variance 3.4104
Sum 1.8202e+06
Skewness -0.1338
Kurtosis 1.2826
Coefficient of Variation 0.5093
  • person_prefer_e is not normally distributed (p-value 8.809743661476187e-15)
  • person_prefer_e has 4861 outliers

person_prefer_f

categorical

Approximate Distinct Count 1
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 31.6 MB

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 1
3rd row 1
4th row 1
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 501951
  • person_prefer_f has words of constant length

person_prefer_g

categorical

Approximate Distinct Count 1
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 31.6 MB

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 1
3rd row 1
4th row 1
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 501951
  • person_prefer_g has words of constant length

person_prefer_h_1

numerical

Approximate Distinct Count 279
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 7.7 MB
Mean 116.3949
Minimum 2
Maximum 313
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • person_prefer_h_1 is skewed right (γ1 = 0.6152)

Quantile Statistics

Minimum 2
5-th Percentile 4
Q1 43
Median 95
Q3 190
95-th Percentile 285
Maximum 313
Range 311
IQR 147

Descriptive Statistics

Mean 116.3949
Standard Deviation 91.033
Variance 8287.0051
Sum 5.8425e+07
Skewness 0.6152
Kurtosis -0.8422
Coefficient of Variation 0.7821
  • person_prefer_h_1 is not normally distributed (p-value 2.0933311027384922e-10)

person_prefer_h_2

numerical

Approximate Distinct Count 279
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 7.7 MB
Mean 136.012
Minimum 2
Maximum 313
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • person_prefer_h_2 is skewed right (γ1 = 0.2864)

Quantile Statistics

Minimum 2
5-th Percentile 4
Q1 59
Median 116
Q3 227
95-th Percentile 281
Maximum 313
Range 311
IQR 168

Descriptive Statistics

Mean 136.012
Standard Deviation 93.7562
Variance 8790.2266
Sum 6.8271e+07
Skewness 0.2864
Kurtosis -1.2472
Coefficient of Variation 0.6893
  • person_prefer_h_2 is not normally distributed (p-value 3.407716695560223e-08)

person_prefer_h_3

numerical

Approximate Distinct Count 279
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 7.7 MB
Mean 122.7847
Minimum 2
Maximum 313
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • person_prefer_h_3 is skewed right (γ1 = 0.5968)

Quantile Statistics

Minimum 2
5-th Percentile 4
Q1 59
Median 95
Q3 199
95-th Percentile 288
Maximum 313
Range 311
IQR 140

Descriptive Statistics

Mean 122.7847
Standard Deviation 90.9479
Variance 8271.5235
Sum 6.1632e+07
Skewness 0.5968
Kurtosis -0.9102
Coefficient of Variation 0.7407
  • person_prefer_h_3 is not normally distributed (p-value 1.9183702749843002e-06)

contents_attribute_i

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 31.6 MB
  • The largest value (3) is over 2.93 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 3
2nd row 1
3rd row 3
4th row 1
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 501951
  • The top 2 categories (3, 1) take over 50.0%
  • The largest value (3) is over 2.93 times larger than the second largest value (1)
  • contents_attribute_i has words of constant length

contents_attribute_a

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 31.6 MB
  • The largest value (3) is over 2.18 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 3
2nd row 3
3rd row 1
4th row 3
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 501951
  • The top 2 categories (3, 1) take over 50.0%
  • The largest value (3) is over 2.18 times larger than the second largest value (1)
  • contents_attribute_a has words of constant length

contents_attribute_j_1

categorical

Approximate Distinct Count 9
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 31.7 MB
  • The largest value (5) is over 3.7 times larger than the second largest value (10)

Length

Mean 1.1779
Standard Deviation 0.3824
Median 1
Minimum 1
Maximum 2

Sample

1st row 10
2nd row 5
3rd row 10
4th row 5
5th row 10

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 591258
  • The top 2 categories (5, 10) take over 50.0%
  • The largest value (5) is over 3.7 times larger than the second largest value (10)

contents_attribute_j

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 31.6 MB
  • The largest value (1) is over 3.21 times larger than the second largest value (2)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 2
2nd row 1
3rd row 2
4th row 1
5th row 2

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 501951
  • The top 2 categories (1, 2) take over 50.0%
  • The largest value (1) is over 3.21 times larger than the second largest value (2)
  • contents_attribute_j has words of constant length

contents_attribute_c

categorical

Approximate Distinct Count 4
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 31.6 MB
  • The largest value (1) is over 4.84 times larger than the second largest value (3)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 1
3rd row 1
4th row 1
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 501951
  • The top 2 categories (1, 3) take over 50.0%
  • The largest value (1) is over 4.84 times larger than the second largest value (3)
  • contents_attribute_c has words of constant length

contents_attribute_k

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 31.6 MB
  • The largest value (2) is over 26.25 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 2
2nd row 2
3rd row 1
4th row 2
5th row 2

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 501951
  • The top 2 categories (2, 1) take over 50.0%
  • The largest value (2) is over 26.25 times larger than the second largest value (1)
  • contents_attribute_k has words of constant length

contents_attribute_l

numerical

Approximate Distinct Count 1752
Approximate Unique (%) 0.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 7.7 MB
Mean 1030.8632
Minimum 1
Maximum 2013
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • contents_attribute_l is skewed right (γ1 = 0.1343)

Quantile Statistics

Minimum 1
5-th Percentile 204
Q1 597
Median 953
Q3 1582
95-th Percentile 1847
Maximum 2013
Range 2012
IQR 985

Descriptive Statistics

Mean 1030.8632
Standard Deviation 527.2357
Variance 277977.5187
Sum 5.1744e+08
Skewness 0.1343
Kurtosis -1.1441
Coefficient of Variation 0.5115
  • contents_attribute_l is not normally distributed (p-value 1.2545338485507914e-05)

contents_attribute_d

numerical

Approximate Distinct Count 1065
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 7.7 MB
Mean 581.5052
Minimum 4
Maximum 1258
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • contents_attribute_d is skewed right (γ1 = 0.1407)

Quantile Statistics

Minimum 4
5-th Percentile 102
Q1 138
Median 587.22
Q3 975
95-th Percentile 1227
Maximum 1258
Range 1254
IQR 837

Descriptive Statistics

Mean 581.5052
Standard Deviation 413.9158
Variance 171326.2531
Sum 2.9189e+08
Skewness 0.1407
Kurtosis -1.5378
Coefficient of Variation 0.7118
  • contents_attribute_d is not normally distributed (p-value 9.760511599752316e-14)

contents_attribute_m

categorical

Approximate Distinct Count 5
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 31.6 MB
  • The largest value (1) is over 3.16 times larger than the second largest value (4)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 1
3rd row 1
4th row 5
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 501951
  • The top 2 categories (1, 4) take over 50.0%
  • The largest value (1) is over 3.16 times larger than the second largest value (4)
  • contents_attribute_m has words of constant length

contents_attribute_e

numerical

Approximate Distinct Count 12
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 7.7 MB
Mean 3.923
Minimum 0
Maximum 11
Zeros 1025
Zeros (%) 0.2%
Negatives 0
Negatives (%) 0.0%
  • contents_attribute_e is skewed right (γ1 = 1.064)

Quantile Statistics

Minimum 0
5-th Percentile 3
Q1 3
Median 4
Q3 4
95-th Percentile 6
Maximum 11
Range 11
IQR 1

Descriptive Statistics

Mean 3.923
Standard Deviation 1.16
Variance 1.3456
Sum 1.9691e+06
Skewness 1.064
Kurtosis 3.2692
Coefficient of Variation 0.2957
  • contents_attribute_e is not normally distributed (p-value 8.936500749488639e-17)
  • contents_attribute_e has 47632 outliers

contents_attribute_h

numerical

Approximate Distinct Count 250
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 7.7 MB
Mean 132.5309
Minimum 5
Maximum 311
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • contents_attribute_h is skewed right (γ1 = 0.5172)

Quantile Statistics

Minimum 5
5-th Percentile 22
Q1 60
Median 118
Q3 199
95-th Percentile 288
Maximum 311
Range 306
IQR 139

Descriptive Statistics

Mean 132.5309
Standard Deviation 87.1423
Variance 7593.782
Sum 6.6524e+07
Skewness 0.5172
Kurtosis -0.973
Coefficient of Variation 0.6575

person_rn

numerical

Approximate Distinct Count 300177
Approximate Unique (%) 59.8%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 7.7 MB
Mean 514111.5336
Minimum 7
Maximum 1.049e+06
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • person_rn is skewed right (γ1 = 0.1085)

Quantile Statistics

Minimum 7
5-th Percentile 70585
Q1 257796.04
Median 498207
Q3 763745.12
95-th Percentile 997413
Maximum 1.049e+06
Range 1.049e+06
IQR 505949.08

Descriptive Statistics

Mean 514111.5336
Standard Deviation 294354.7432
Variance 8.6645e+10
Sum 2.5806e+11
Skewness 0.1085
Kurtosis -1.1576
Coefficient of Variation 0.5726

contents_rn

numerical

Approximate Distinct Count 283359
Approximate Unique (%) 56.5%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 7.7 MB
Mean 337674.3451
Minimum 20
Maximum 753628
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • contents_rn is skewed right (γ1 = 0.2684)

Quantile Statistics

Minimum 20
5-th Percentile 32551.7
Q1 142255
Median 307588.02
Q3 533687
95-th Percentile 708285.9
Maximum 753628
Range 753608
IQR 391432

Descriptive Statistics

Mean 337674.3451
Standard Deviation 219518.4856
Variance 4.8188e+10
Sum 1.695e+11
Skewness 0.2684
Kurtosis -1.2005
Coefficient of Variation 0.6501
  • contents_rn is not normally distributed (p-value 0.004299202761138759)

contents_open_dt

categorical

Approximate Distinct Count 494952
Approximate Unique (%) 98.6%
Missing 0
Missing (%) 0.0%
Memory Size 40.2 MB

Length

Mean 19
Standard Deviation 0
Median 19
Minimum 19
Maximum 19

Sample

1st row 2020-01-17 12:09:3...
2nd row 2020-06-18 17:48:5...
3rd row 2020-07-08 20:00:1...
4th row 2020-01-13 18:09:3...
5th row 2020-03-09 20:39:2...

Letter

Count 0
Lowercase Letter 0
Space Separator 501951
Uppercase Letter 0
Dash Punctuation 1003902
Decimal Number 7027314
  • contents_open_dt contains many words: 80416 words
  • contents_open_dt has words of constant length

target

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 31.6 MB

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 501951
  • The top 2 categories (0, 1) take over 50.0%
  • target has words of constant length

Interactions

Correlations

Missing Values